Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 178 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 18.9 KiB |
| Average record size in memory | 108.7 B |
Variable types
| Numeric | 13 |
|---|---|
| Categorical | 1 |
alcohol is highly correlated with color_intensity and 1 other fields | High correlation |
malic_acid is highly correlated with hue | High correlation |
alcalinity_of_ash is highly correlated with target | High correlation |
total_phenols is highly correlated with flavanoids and 3 other fields | High correlation |
flavanoids is highly correlated with total_phenols and 5 other fields | High correlation |
nonflavanoid_phenols is highly correlated with flavanoids and 1 other fields | High correlation |
proanthocyanins is highly correlated with total_phenols and 2 other fields | High correlation |
color_intensity is highly correlated with alcohol and 1 other fields | High correlation |
hue is highly correlated with malic_acid and 4 other fields | High correlation |
od280/od315_of_diluted_wines is highly correlated with total_phenols and 5 other fields | High correlation |
proline is highly correlated with alcohol and 1 other fields | High correlation |
target is highly correlated with alcalinity_of_ash and 5 other fields | High correlation |
alcohol is highly correlated with color_intensity and 1 other fields | High correlation |
malic_acid is highly correlated with hue | High correlation |
alcalinity_of_ash is highly correlated with target | High correlation |
magnesium is highly correlated with proline | High correlation |
total_phenols is highly correlated with flavanoids and 3 other fields | High correlation |
flavanoids is highly correlated with total_phenols and 5 other fields | High correlation |
nonflavanoid_phenols is highly correlated with flavanoids | High correlation |
proanthocyanins is highly correlated with total_phenols and 3 other fields | High correlation |
color_intensity is highly correlated with alcohol | High correlation |
hue is highly correlated with malic_acid and 2 other fields | High correlation |
od280/od315_of_diluted_wines is highly correlated with total_phenols and 3 other fields | High correlation |
proline is highly correlated with alcohol and 2 other fields | High correlation |
target is highly correlated with alcalinity_of_ash and 6 other fields | High correlation |
total_phenols is highly correlated with flavanoids and 1 other fields | High correlation |
flavanoids is highly correlated with total_phenols and 3 other fields | High correlation |
proanthocyanins is highly correlated with flavanoids | High correlation |
od280/od315_of_diluted_wines is highly correlated with flavanoids and 1 other fields | High correlation |
target is highly correlated with total_phenols and 2 other fields | High correlation |
flavanoids is highly correlated with color_intensity and 6 other fields | High correlation |
ash is highly correlated with alcalinity_of_ash | High correlation |
color_intensity is highly correlated with flavanoids and 4 other fields | High correlation |
proline is highly correlated with flavanoids and 7 other fields | High correlation |
total_phenols is highly correlated with flavanoids and 5 other fields | High correlation |
proanthocyanins is highly correlated with flavanoids and 6 other fields | High correlation |
hue is highly correlated with flavanoids and 3 other fields | High correlation |
alcohol is highly correlated with color_intensity and 4 other fields | High correlation |
malic_acid is highly correlated with total_phenols and 1 other fields | High correlation |
magnesium is highly correlated with proline and 2 other fields | High correlation |
alcalinity_of_ash is highly correlated with ash and 3 other fields | High correlation |
od280/od315_of_diluted_wines is highly correlated with flavanoids and 5 other fields | High correlation |
target is highly correlated with flavanoids and 11 other fields | High correlation |
nonflavanoid_phenols is highly correlated with od280/od315_of_diluted_wines and 1 other fields | High correlation |
Reproduction
| Analysis started | 2021-08-03 19:47:21.881525 |
|---|---|
| Analysis finished | 2021-08-03 19:47:53.018006 |
| Duration | 31.14 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 126 |
|---|---|
| Distinct (%) | 70.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.00061798 |
| Minimum | 11.03 |
|---|---|
| Maximum | 14.83 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 11.03 |
|---|---|
| 5-th percentile | 11.6585 |
| Q1 | 12.3625 |
| median | 13.05 |
| Q3 | 13.6775 |
| 95-th percentile | 14.2215 |
| Maximum | 14.83 |
| Range | 3.8 |
| Interquartile range (IQR) | 1.315 |
Descriptive statistics
| Standard deviation | 0.811826538 |
|---|---|
| Coefficient of variation (CV) | 0.06244522679 |
| Kurtosis | -0.8524995685 |
| Mean | 13.00061798 |
| Median Absolute Deviation (MAD) | 0.68 |
| Skewness | -0.05148233108 |
| Sum | 2314.11 |
| Variance | 0.6590623278 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 12.37 | 6 | 3.4% |
| 13.05 | 6 | 3.4% |
| 12.08 | 5 | 2.8% |
| 12.29 | 4 | 2.2% |
| 12 | 3 | 1.7% |
| 12.25 | 3 | 1.7% |
| 12.42 | 3 | 1.7% |
| 12.93 | 2 | 1.1% |
| 12.6 | 2 | 1.1% |
| 12.85 | 2 | 1.1% |
| Other values (116) | 142 |
| Value | Count | Frequency (%) |
| 11.03 | 1 | |
| 11.41 | 1 | |
| 11.45 | 1 | |
| 11.46 | 1 | |
| 11.56 | 1 | |
| 11.61 | 1 | |
| 11.62 | 1 | |
| 11.64 | 1 | |
| 11.65 | 1 | |
| 11.66 | 1 |
| Value | Count | Frequency (%) |
| 14.83 | 1 | |
| 14.75 | 1 | |
| 14.39 | 1 | |
| 14.38 | 2 | |
| 14.37 | 1 | |
| 14.34 | 1 | |
| 14.3 | 1 | |
| 14.23 | 1 | |
| 14.22 | 2 | |
| 14.21 | 1 |
| Distinct | 133 |
|---|---|
| Distinct (%) | 74.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.336348315 |
| Minimum | 0.74 |
|---|---|
| Maximum | 5.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 0.74 |
|---|---|
| 5-th percentile | 1.061 |
| Q1 | 1.6025 |
| median | 1.865 |
| Q3 | 3.0825 |
| 95-th percentile | 4.4555 |
| Maximum | 5.8 |
| Range | 5.06 |
| Interquartile range (IQR) | 1.48 |
Descriptive statistics
| Standard deviation | 1.117146098 |
|---|---|
| Coefficient of variation (CV) | 0.478159053 |
| Kurtosis | 0.2992066799 |
| Mean | 2.336348315 |
| Median Absolute Deviation (MAD) | 0.52 |
| Skewness | 1.039651193 |
| Sum | 415.87 |
| Variance | 1.248015403 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.73 | 7 | 3.9% |
| 1.81 | 4 | 2.2% |
| 1.67 | 4 | 2.2% |
| 1.68 | 3 | 1.7% |
| 1.61 | 3 | 1.7% |
| 1.51 | 3 | 1.7% |
| 1.35 | 3 | 1.7% |
| 1.53 | 3 | 1.7% |
| 1.9 | 3 | 1.7% |
| 3.17 | 2 | 1.1% |
| Other values (123) | 143 |
| Value | Count | Frequency (%) |
| 0.74 | 1 | |
| 0.89 | 1 | |
| 0.9 | 1 | |
| 0.92 | 1 | |
| 0.94 | 2 | |
| 0.98 | 1 | |
| 0.99 | 1 | |
| 1.01 | 1 | |
| 1.07 | 1 | |
| 1.09 | 1 |
| Value | Count | Frequency (%) |
| 5.8 | 1 | |
| 5.65 | 1 | |
| 5.51 | 1 | |
| 5.19 | 1 | |
| 5.04 | 1 | |
| 4.95 | 1 | |
| 4.72 | 1 | |
| 4.61 | 1 | |
| 4.6 | 1 | |
| 4.43 | 1 |
| Distinct | 79 |
|---|---|
| Distinct (%) | 44.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.366516854 |
| Minimum | 1.36 |
|---|---|
| Maximum | 3.23 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 1.36 |
|---|---|
| 5-th percentile | 1.92 |
| Q1 | 2.21 |
| median | 2.36 |
| Q3 | 2.5575 |
| 95-th percentile | 2.7415 |
| Maximum | 3.23 |
| Range | 1.87 |
| Interquartile range (IQR) | 0.3475 |
Descriptive statistics
| Standard deviation | 0.2743440091 |
|---|---|
| Coefficient of variation (CV) | 0.1159273422 |
| Kurtosis | 1.143978169 |
| Mean | 2.366516854 |
| Median Absolute Deviation (MAD) | 0.16 |
| Skewness | -0.1766993165 |
| Sum | 421.24 |
| Variance | 0.07526463531 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.3 | 7 | 3.9% |
| 2.28 | 7 | 3.9% |
| 2.7 | 6 | 3.4% |
| 2.36 | 6 | 3.4% |
| 2.32 | 6 | 3.4% |
| 2.48 | 5 | 2.8% |
| 2.2 | 5 | 2.8% |
| 2.38 | 5 | 2.8% |
| 2.5 | 4 | 2.2% |
| 2.4 | 4 | 2.2% |
| Other values (69) | 123 |
| Value | Count | Frequency (%) |
| 1.36 | 1 | 0.6% |
| 1.7 | 2 | |
| 1.71 | 1 | 0.6% |
| 1.75 | 1 | 0.6% |
| 1.82 | 1 | 0.6% |
| 1.88 | 1 | 0.6% |
| 1.9 | 1 | 0.6% |
| 1.92 | 3 | |
| 1.94 | 1 | 0.6% |
| 1.95 | 1 | 0.6% |
| Value | Count | Frequency (%) |
| 3.23 | 1 | |
| 3.22 | 1 | |
| 2.92 | 1 | |
| 2.87 | 1 | |
| 2.86 | 1 | |
| 2.84 | 1 | |
| 2.8 | 1 | |
| 2.78 | 1 | |
| 2.75 | 1 | |
| 2.74 | 2 |
| Distinct | 63 |
|---|---|
| Distinct (%) | 35.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.49494382 |
| Minimum | 10.6 |
|---|---|
| Maximum | 30 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 10.6 |
|---|---|
| 5-th percentile | 14.77 |
| Q1 | 17.2 |
| median | 19.5 |
| Q3 | 21.5 |
| 95-th percentile | 25 |
| Maximum | 30 |
| Range | 19.4 |
| Interquartile range (IQR) | 4.3 |
Descriptive statistics
| Standard deviation | 3.339563767 |
|---|---|
| Coefficient of variation (CV) | 0.171304098 |
| Kurtosis | 0.4879415405 |
| Mean | 19.49494382 |
| Median Absolute Deviation (MAD) | 2.05 |
| Skewness | 0.2130468864 |
| Sum | 3470.1 |
| Variance | 11.15268616 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 20 | 15 | 8.4% |
| 21 | 11 | 6.2% |
| 16 | 11 | 6.2% |
| 18 | 10 | 5.6% |
| 19 | 9 | 5.1% |
| 21.5 | 8 | 4.5% |
| 18.5 | 7 | 3.9% |
| 22 | 7 | 3.9% |
| 19.5 | 7 | 3.9% |
| 22.5 | 7 | 3.9% |
| Other values (53) | 86 |
| Value | Count | Frequency (%) |
| 10.6 | 1 | |
| 11.2 | 1 | |
| 11.4 | 1 | |
| 12 | 1 | |
| 12.4 | 1 | |
| 13.2 | 1 | |
| 14 | 2 | |
| 14.6 | 1 | |
| 14.8 | 1 | |
| 15 | 2 |
| Value | Count | Frequency (%) |
| 30 | 1 | 0.6% |
| 28.5 | 2 | 1.1% |
| 27 | 1 | 0.6% |
| 26.5 | 1 | 0.6% |
| 26 | 1 | 0.6% |
| 25.5 | 1 | 0.6% |
| 25 | 5 | |
| 24.5 | 3 | |
| 24 | 5 | |
| 23.6 | 1 | 0.6% |
| Distinct | 53 |
|---|---|
| Distinct (%) | 29.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 99.74157303 |
| Minimum | 70 |
|---|---|
| Maximum | 162 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 70 |
|---|---|
| 5-th percentile | 80.85 |
| Q1 | 88 |
| median | 98 |
| Q3 | 107 |
| 95-th percentile | 124.3 |
| Maximum | 162 |
| Range | 92 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 14.28248352 |
|---|---|
| Coefficient of variation (CV) | 0.1431948894 |
| Kurtosis | 2.104991324 |
| Mean | 99.74157303 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 1.098191055 |
| Sum | 17754 |
| Variance | 203.9893354 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 88 | 13 | 7.3% |
| 86 | 11 | 6.2% |
| 98 | 9 | 5.1% |
| 101 | 9 | 5.1% |
| 96 | 8 | 4.5% |
| 102 | 7 | 3.9% |
| 112 | 6 | 3.4% |
| 85 | 6 | 3.4% |
| 94 | 6 | 3.4% |
| 80 | 5 | 2.8% |
| Other values (43) | 98 |
| Value | Count | Frequency (%) |
| 70 | 1 | 0.6% |
| 78 | 3 | 1.7% |
| 80 | 5 | 2.8% |
| 81 | 1 | 0.6% |
| 82 | 1 | 0.6% |
| 84 | 3 | 1.7% |
| 85 | 6 | |
| 86 | 11 | |
| 87 | 3 | 1.7% |
| 88 | 13 |
| Value | Count | Frequency (%) |
| 162 | 1 | |
| 151 | 1 | |
| 139 | 1 | |
| 136 | 1 | |
| 134 | 1 | |
| 132 | 1 | |
| 128 | 1 | |
| 127 | 1 | |
| 126 | 1 | |
| 124 | 1 |
| Distinct | 97 |
|---|---|
| Distinct (%) | 54.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.29511236 |
| Minimum | 0.98 |
|---|---|
| Maximum | 3.88 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 0.98 |
|---|---|
| 5-th percentile | 1.38 |
| Q1 | 1.7425 |
| median | 2.355 |
| Q3 | 2.8 |
| 95-th percentile | 3.2745 |
| Maximum | 3.88 |
| Range | 2.9 |
| Interquartile range (IQR) | 1.0575 |
Descriptive statistics
| Standard deviation | 0.6258510488 |
|---|---|
| Coefficient of variation (CV) | 0.2726886317 |
| Kurtosis | -0.8356265234 |
| Mean | 2.29511236 |
| Median Absolute Deviation (MAD) | 0.505 |
| Skewness | 0.0866385864 |
| Sum | 408.53 |
| Variance | 0.3916895353 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.2 | 8 | 4.5% |
| 3 | 6 | 3.4% |
| 2.8 | 6 | 3.4% |
| 2.6 | 6 | 3.4% |
| 2 | 5 | 2.8% |
| 2.95 | 5 | 2.8% |
| 1.38 | 4 | 2.2% |
| 1.65 | 4 | 2.2% |
| 2.45 | 4 | 2.2% |
| 2.85 | 4 | 2.2% |
| Other values (87) | 126 |
| Value | Count | Frequency (%) |
| 0.98 | 1 | 0.6% |
| 1.1 | 1 | 0.6% |
| 1.15 | 1 | 0.6% |
| 1.25 | 1 | 0.6% |
| 1.28 | 1 | 0.6% |
| 1.3 | 1 | 0.6% |
| 1.35 | 1 | 0.6% |
| 1.38 | 4 | |
| 1.39 | 2 | |
| 1.4 | 2 |
| Value | Count | Frequency (%) |
| 3.88 | 1 | 0.6% |
| 3.85 | 1 | 0.6% |
| 3.52 | 1 | 0.6% |
| 3.5 | 1 | 0.6% |
| 3.4 | 1 | 0.6% |
| 3.38 | 1 | 0.6% |
| 3.3 | 3 | |
| 3.27 | 1 | 0.6% |
| 3.25 | 2 | |
| 3.2 | 1 | 0.6% |
| Distinct | 132 |
|---|---|
| Distinct (%) | 74.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.029269663 |
| Minimum | 0.34 |
|---|---|
| Maximum | 5.08 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 0.34 |
|---|---|
| 5-th percentile | 0.5455 |
| Q1 | 1.205 |
| median | 2.135 |
| Q3 | 2.875 |
| 95-th percentile | 3.4975 |
| Maximum | 5.08 |
| Range | 4.74 |
| Interquartile range (IQR) | 1.67 |
Descriptive statistics
| Standard deviation | 0.998858685 |
|---|---|
| Coefficient of variation (CV) | 0.4922257023 |
| Kurtosis | -0.8803815472 |
| Mean | 2.029269663 |
| Median Absolute Deviation (MAD) | 0.835 |
| Skewness | 0.02534355338 |
| Sum | 361.21 |
| Variance | 0.9977186726 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.65 | 4 | 2.2% |
| 0.58 | 3 | 1.7% |
| 2.68 | 3 | 1.7% |
| 0.6 | 3 | 1.7% |
| 1.25 | 3 | 1.7% |
| 2.03 | 3 | 1.7% |
| 0.92 | 2 | 1.1% |
| 0.66 | 2 | 1.1% |
| 2.43 | 2 | 1.1% |
| 2.98 | 2 | 1.1% |
| Other values (122) | 151 |
| Value | Count | Frequency (%) |
| 0.34 | 1 | |
| 0.47 | 2 | |
| 0.48 | 1 | |
| 0.49 | 1 | |
| 0.5 | 2 | |
| 0.51 | 1 | |
| 0.52 | 1 | |
| 0.55 | 1 | |
| 0.56 | 1 | |
| 0.57 | 1 |
| Value | Count | Frequency (%) |
| 5.08 | 1 | |
| 3.93 | 1 | |
| 3.75 | 1 | |
| 3.74 | 1 | |
| 3.69 | 1 | |
| 3.67 | 1 | |
| 3.64 | 1 | |
| 3.56 | 1 | |
| 3.54 | 1 | |
| 3.49 | 1 |
| Distinct | 39 |
|---|---|
| Distinct (%) | 21.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3618539326 |
| Minimum | 0.13 |
|---|---|
| Maximum | 0.66 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 0.13 |
|---|---|
| 5-th percentile | 0.19 |
| Q1 | 0.27 |
| median | 0.34 |
| Q3 | 0.4375 |
| 95-th percentile | 0.6 |
| Maximum | 0.66 |
| Range | 0.53 |
| Interquartile range (IQR) | 0.1675 |
Descriptive statistics
| Standard deviation | 0.1244533403 |
|---|---|
| Coefficient of variation (CV) | 0.3439325349 |
| Kurtosis | -0.6371910641 |
| Mean | 0.3618539326 |
| Median Absolute Deviation (MAD) | 0.085 |
| Skewness | 0.4501513356 |
| Sum | 64.41 |
| Variance | 0.01548863391 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=39)
| Value | Count | Frequency (%) |
| 0.26 | 11 | 6.2% |
| 0.43 | 11 | 6.2% |
| 0.29 | 10 | 5.6% |
| 0.32 | 9 | 5.1% |
| 0.3 | 8 | 4.5% |
| 0.37 | 8 | 4.5% |
| 0.34 | 8 | 4.5% |
| 0.27 | 8 | 4.5% |
| 0.4 | 8 | 4.5% |
| 0.24 | 7 | 3.9% |
| Other values (29) | 90 |
| Value | Count | Frequency (%) |
| 0.13 | 1 | 0.6% |
| 0.14 | 2 | 1.1% |
| 0.17 | 5 | |
| 0.19 | 2 | 1.1% |
| 0.2 | 2 | 1.1% |
| 0.21 | 6 | |
| 0.22 | 6 | |
| 0.24 | 7 | |
| 0.25 | 2 | 1.1% |
| 0.26 | 11 |
| Value | Count | Frequency (%) |
| 0.66 | 1 | 0.6% |
| 0.63 | 4 | |
| 0.61 | 3 | |
| 0.6 | 3 | |
| 0.58 | 3 | |
| 0.56 | 1 | 0.6% |
| 0.55 | 1 | 0.6% |
| 0.53 | 7 | |
| 0.52 | 5 | |
| 0.5 | 5 |
proanthocyanins
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 101 |
|---|---|
| Distinct (%) | 56.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.590898876 |
| Minimum | 0.41 |
|---|---|
| Maximum | 3.58 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 0.41 |
|---|---|
| 5-th percentile | 0.73 |
| Q1 | 1.25 |
| median | 1.555 |
| Q3 | 1.95 |
| 95-th percentile | 2.709 |
| Maximum | 3.58 |
| Range | 3.17 |
| Interquartile range (IQR) | 0.7 |
Descriptive statistics
| Standard deviation | 0.5723588627 |
|---|---|
| Coefficient of variation (CV) | 0.3597707379 |
| Kurtosis | 0.5546485226 |
| Mean | 1.590898876 |
| Median Absolute Deviation (MAD) | 0.38 |
| Skewness | 0.5171371723 |
| Sum | 283.18 |
| Variance | 0.3275946677 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.35 | 9 | 5.1% |
| 1.46 | 7 | 3.9% |
| 1.87 | 6 | 3.4% |
| 1.25 | 5 | 2.8% |
| 1.56 | 4 | 2.2% |
| 1.66 | 4 | 2.2% |
| 1.98 | 4 | 2.2% |
| 2.08 | 4 | 2.2% |
| 1.77 | 3 | 1.7% |
| 1.63 | 3 | 1.7% |
| Other values (91) | 129 |
| Value | Count | Frequency (%) |
| 0.41 | 1 | |
| 0.42 | 2 | |
| 0.55 | 1 | |
| 0.62 | 1 | |
| 0.64 | 2 | |
| 0.68 | 1 | |
| 0.73 | 2 | |
| 0.75 | 1 | |
| 0.8 | 2 | |
| 0.81 | 1 |
| Value | Count | Frequency (%) |
| 3.58 | 1 | 0.6% |
| 3.28 | 1 | 0.6% |
| 2.96 | 1 | 0.6% |
| 2.91 | 2 | |
| 2.81 | 3 | |
| 2.76 | 1 | 0.6% |
| 2.7 | 1 | 0.6% |
| 2.5 | 1 | 0.6% |
| 2.49 | 1 | 0.6% |
| 2.45 | 1 | 0.6% |
| Distinct | 132 |
|---|---|
| Distinct (%) | 74.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.058089882 |
| Minimum | 1.28 |
|---|---|
| Maximum | 13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 1.28 |
|---|---|
| 5-th percentile | 2.114 |
| Q1 | 3.22 |
| median | 4.69 |
| Q3 | 6.2 |
| 95-th percentile | 9.598 |
| Maximum | 13 |
| Range | 11.72 |
| Interquartile range (IQR) | 2.98 |
Descriptive statistics
| Standard deviation | 2.318285872 |
|---|---|
| Coefficient of variation (CV) | 0.4583322807 |
| Kurtosis | 0.3815222728 |
| Mean | 5.058089882 |
| Median Absolute Deviation (MAD) | 1.51 |
| Skewness | 0.868584791 |
| Sum | 900.339999 |
| Variance | 5.374449383 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.6 | 4 | 2.2% |
| 4.6 | 4 | 2.2% |
| 3.8 | 4 | 2.2% |
| 3.4 | 3 | 1.7% |
| 3.05 | 3 | 1.7% |
| 2.9 | 3 | 1.7% |
| 5 | 3 | 1.7% |
| 4.5 | 3 | 1.7% |
| 5.7 | 3 | 1.7% |
| 2.8 | 3 | 1.7% |
| Other values (122) | 145 |
| Value | Count | Frequency (%) |
| 1.28 | 1 | |
| 1.74 | 1 | |
| 1.9 | 1 | |
| 1.95 | 2 | |
| 2 | 1 | |
| 2.06 | 2 | |
| 2.08 | 1 | |
| 2.12 | 1 | |
| 2.15 | 1 | |
| 2.2 | 1 |
| Value | Count | Frequency (%) |
| 13 | 1 | |
| 11.75 | 1 | |
| 10.8 | 1 | |
| 10.68 | 1 | |
| 10.52 | 1 | |
| 10.26 | 1 | |
| 10.2 | 1 | |
| 9.899999 | 1 | |
| 9.7 | 1 | |
| 9.58 | 1 |
| Distinct | 78 |
|---|---|
| Distinct (%) | 43.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9574494382 |
| Minimum | 0.48 |
|---|---|
| Maximum | 1.71 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 0.48 |
|---|---|
| 5-th percentile | 0.57 |
| Q1 | 0.7825 |
| median | 0.965 |
| Q3 | 1.12 |
| 95-th percentile | 1.2845 |
| Maximum | 1.71 |
| Range | 1.23 |
| Interquartile range (IQR) | 0.3375 |
Descriptive statistics
| Standard deviation | 0.2285715658 |
|---|---|
| Coefficient of variation (CV) | 0.2387296464 |
| Kurtosis | -0.3440957414 |
| Mean | 0.9574494382 |
| Median Absolute Deviation (MAD) | 0.165 |
| Skewness | 0.0210912722 |
| Sum | 170.426 |
| Variance | 0.05224496071 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.04 | 8 | 4.5% |
| 1.23 | 7 | 3.9% |
| 1.12 | 6 | 3.4% |
| 0.89 | 5 | 2.8% |
| 0.57 | 5 | 2.8% |
| 0.96 | 5 | 2.8% |
| 1.25 | 5 | 2.8% |
| 1.05 | 4 | 2.2% |
| 1.09 | 4 | 2.2% |
| 0.75 | 4 | 2.2% |
| Other values (68) | 125 |
| Value | Count | Frequency (%) |
| 0.48 | 1 | 0.6% |
| 0.54 | 1 | 0.6% |
| 0.55 | 1 | 0.6% |
| 0.56 | 2 | 1.1% |
| 0.57 | 5 | |
| 0.58 | 2 | 1.1% |
| 0.59 | 2 | 1.1% |
| 0.6 | 3 | |
| 0.61 | 2 | 1.1% |
| 0.62 | 1 | 0.6% |
| Value | Count | Frequency (%) |
| 1.71 | 1 | 0.6% |
| 1.45 | 1 | 0.6% |
| 1.42 | 1 | 0.6% |
| 1.38 | 1 | 0.6% |
| 1.36 | 2 | 1.1% |
| 1.33 | 1 | 0.6% |
| 1.31 | 2 | 1.1% |
| 1.28 | 2 | 1.1% |
| 1.27 | 1 | 0.6% |
| 1.25 | 5 |
od280/od315_of_diluted_wines
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 122 |
|---|---|
| Distinct (%) | 68.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.611685393 |
| Minimum | 1.27 |
|---|---|
| Maximum | 4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 1.27 |
|---|---|
| 5-th percentile | 1.4625 |
| Q1 | 1.9375 |
| median | 2.78 |
| Q3 | 3.17 |
| 95-th percentile | 3.58 |
| Maximum | 4 |
| Range | 2.73 |
| Interquartile range (IQR) | 1.2325 |
Descriptive statistics
| Standard deviation | 0.7099904288 |
|---|---|
| Coefficient of variation (CV) | 0.2718514376 |
| Kurtosis | -1.086434527 |
| Mean | 2.611685393 |
| Median Absolute Deviation (MAD) | 0.52 |
| Skewness | -0.307285499 |
| Sum | 464.88 |
| Variance | 0.5040864089 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.87 | 5 | 2.8% |
| 3 | 4 | 2.2% |
| 1.82 | 4 | 2.2% |
| 2.78 | 4 | 2.2% |
| 2.77 | 3 | 1.7% |
| 1.75 | 3 | 1.7% |
| 1.33 | 3 | 1.7% |
| 2.31 | 3 | 1.7% |
| 3.33 | 3 | 1.7% |
| 2.96 | 3 | 1.7% |
| Other values (112) | 143 |
| Value | Count | Frequency (%) |
| 1.27 | 1 | 0.6% |
| 1.29 | 2 | |
| 1.3 | 1 | 0.6% |
| 1.33 | 3 | |
| 1.36 | 1 | 0.6% |
| 1.42 | 1 | 0.6% |
| 1.47 | 1 | 0.6% |
| 1.48 | 1 | 0.6% |
| 1.51 | 2 | |
| 1.55 | 1 | 0.6% |
| Value | Count | Frequency (%) |
| 4 | 1 | |
| 3.92 | 1 | |
| 3.82 | 1 | |
| 3.71 | 1 | |
| 3.69 | 1 | |
| 3.64 | 1 | |
| 3.63 | 1 | |
| 3.59 | 1 | |
| 3.58 | 2 | |
| 3.57 | 1 |
| Distinct | 121 |
|---|---|
| Distinct (%) | 68.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 746.8932584 |
| Minimum | 278 |
|---|---|
| Maximum | 1680 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 278 |
|---|---|
| 5-th percentile | 354.55 |
| Q1 | 500.5 |
| median | 673.5 |
| Q3 | 985 |
| 95-th percentile | 1297.25 |
| Maximum | 1680 |
| Range | 1402 |
| Interquartile range (IQR) | 484.5 |
Descriptive statistics
| Standard deviation | 314.9074743 |
|---|---|
| Coefficient of variation (CV) | 0.4216231312 |
| Kurtosis | -0.2484031061 |
| Mean | 746.8932584 |
| Median Absolute Deviation (MAD) | 202.5 |
| Skewness | 0.7678217814 |
| Sum | 132947 |
| Variance | 99166.71736 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 680 | 5 | 2.8% |
| 520 | 5 | 2.8% |
| 750 | 4 | 2.2% |
| 630 | 4 | 2.2% |
| 625 | 4 | 2.2% |
| 495 | 3 | 1.7% |
| 562 | 3 | 1.7% |
| 450 | 3 | 1.7% |
| 480 | 3 | 1.7% |
| 660 | 3 | 1.7% |
| Other values (111) | 141 |
| Value | Count | Frequency (%) |
| 278 | 1 | |
| 290 | 1 | |
| 312 | 1 | |
| 315 | 1 | |
| 325 | 1 | |
| 342 | 1 | |
| 345 | 2 | |
| 352 | 1 | |
| 355 | 1 | |
| 365 | 1 |
| Value | Count | Frequency (%) |
| 1680 | 1 | |
| 1547 | 1 | |
| 1515 | 1 | |
| 1510 | 1 | |
| 1480 | 1 | |
| 1450 | 1 | |
| 1375 | 1 | |
| 1320 | 1 | |
| 1310 | 1 | |
| 1295 | 1 |
| Distinct | 3 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 KiB |
| 1 | |
|---|---|
| 0 | |
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 178 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 71 | |
| 0 | 59 | |
| 2 | 48 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 1 | 71 | |
| 0 | 59 | |
| 2 | 48 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 71 | |
| 0 | 59 | |
| 2 | 48 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 178 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 71 | |
| 0 | 59 | |
| 2 | 48 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 178 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 71 | |
| 0 | 59 | |
| 2 | 48 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 178 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 71 | |
| 0 | 59 | |
| 2 | 48 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| alcohol | malic_acid | ash | alcalinity_of_ash | magnesium | total_phenols | flavanoids | nonflavanoid_phenols | proanthocyanins | color_intensity | hue | od280/od315_of_diluted_wines | proline | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 14.23 | 1.71 | 2.43 | 15.6 | 127.0 | 2.80 | 3.06 | 0.28 | 2.29 | 5.64 | 1.04 | 3.92 | 1065.0 | 0 |
| 1 | 13.20 | 1.78 | 2.14 | 11.2 | 100.0 | 2.65 | 2.76 | 0.26 | 1.28 | 4.38 | 1.05 | 3.40 | 1050.0 | 0 |
| 2 | 13.16 | 2.36 | 2.67 | 18.6 | 101.0 | 2.80 | 3.24 | 0.30 | 2.81 | 5.68 | 1.03 | 3.17 | 1185.0 | 0 |
| 3 | 14.37 | 1.95 | 2.50 | 16.8 | 113.0 | 3.85 | 3.49 | 0.24 | 2.18 | 7.80 | 0.86 | 3.45 | 1480.0 | 0 |
| 4 | 13.24 | 2.59 | 2.87 | 21.0 | 118.0 | 2.80 | 2.69 | 0.39 | 1.82 | 4.32 | 1.04 | 2.93 | 735.0 | 0 |
| 5 | 14.20 | 1.76 | 2.45 | 15.2 | 112.0 | 3.27 | 3.39 | 0.34 | 1.97 | 6.75 | 1.05 | 2.85 | 1450.0 | 0 |
| 6 | 14.39 | 1.87 | 2.45 | 14.6 | 96.0 | 2.50 | 2.52 | 0.30 | 1.98 | 5.25 | 1.02 | 3.58 | 1290.0 | 0 |
| 7 | 14.06 | 2.15 | 2.61 | 17.6 | 121.0 | 2.60 | 2.51 | 0.31 | 1.25 | 5.05 | 1.06 | 3.58 | 1295.0 | 0 |
| 8 | 14.83 | 1.64 | 2.17 | 14.0 | 97.0 | 2.80 | 2.98 | 0.29 | 1.98 | 5.20 | 1.08 | 2.85 | 1045.0 | 0 |
| 9 | 13.86 | 1.35 | 2.27 | 16.0 | 98.0 | 2.98 | 3.15 | 0.22 | 1.85 | 7.22 | 1.01 | 3.55 | 1045.0 | 0 |
Last rows
| alcohol | malic_acid | ash | alcalinity_of_ash | magnesium | total_phenols | flavanoids | nonflavanoid_phenols | proanthocyanins | color_intensity | hue | od280/od315_of_diluted_wines | proline | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 168 | 13.58 | 2.58 | 2.69 | 24.5 | 105.0 | 1.55 | 0.84 | 0.39 | 1.54 | 8.660000 | 0.74 | 1.80 | 750.0 | 2 |
| 169 | 13.40 | 4.60 | 2.86 | 25.0 | 112.0 | 1.98 | 0.96 | 0.27 | 1.11 | 8.500000 | 0.67 | 1.92 | 630.0 | 2 |
| 170 | 12.20 | 3.03 | 2.32 | 19.0 | 96.0 | 1.25 | 0.49 | 0.40 | 0.73 | 5.500000 | 0.66 | 1.83 | 510.0 | 2 |
| 171 | 12.77 | 2.39 | 2.28 | 19.5 | 86.0 | 1.39 | 0.51 | 0.48 | 0.64 | 9.899999 | 0.57 | 1.63 | 470.0 | 2 |
| 172 | 14.16 | 2.51 | 2.48 | 20.0 | 91.0 | 1.68 | 0.70 | 0.44 | 1.24 | 9.700000 | 0.62 | 1.71 | 660.0 | 2 |
| 173 | 13.71 | 5.65 | 2.45 | 20.5 | 95.0 | 1.68 | 0.61 | 0.52 | 1.06 | 7.700000 | 0.64 | 1.74 | 740.0 | 2 |
| 174 | 13.40 | 3.91 | 2.48 | 23.0 | 102.0 | 1.80 | 0.75 | 0.43 | 1.41 | 7.300000 | 0.70 | 1.56 | 750.0 | 2 |
| 175 | 13.27 | 4.28 | 2.26 | 20.0 | 120.0 | 1.59 | 0.69 | 0.43 | 1.35 | 10.200000 | 0.59 | 1.56 | 835.0 | 2 |
| 176 | 13.17 | 2.59 | 2.37 | 20.0 | 120.0 | 1.65 | 0.68 | 0.53 | 1.46 | 9.300000 | 0.60 | 1.62 | 840.0 | 2 |
| 177 | 14.13 | 4.10 | 2.74 | 24.5 | 96.0 | 2.05 | 0.76 | 0.56 | 1.35 | 9.200000 | 0.61 | 1.60 | 560.0 | 2 |